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Abstract 

This paper describes Postfix-GP system, postfix notation 
based Genetic Programming (GP), for solving symbolic 
regression problems. It presents an object-oriented archi¬ 
tecture of Postfix-GP framework. It assists the user in 
understanding of the implementation details of various 
components of Postfix-GP. Postfix-GP provides graphi¬ 
cal user interface which allows user to configure the ex¬ 
periment, to visualize evolved solutions, to analyze GP 
run, and to perform out-of-sample predictions. The use 
of Postfix-GP is demonstrated by solving the benchmark 
symbolic regression problem. Finally, features of Postfix- 
GP framework are compared with that of other GP sys¬ 
tems. 

Keywords- Postfix Genetic Programming; Postfix-GP 
Framework; Object Oriented Design; GP Software Tool; 
Symbolic Regression 

1 Introduction 

Evolutionary Algorithms (EA) are a group of computa¬ 
tional techniques, which employ the theory of natural se¬ 
lection to a population of individuals to generate better 
individuals. Genetic Programming (GP) is a paradigm of 
EA which uses hierarchical, tree structure, variable length 
representation to code solutions of a problem. GP can be 
used to intelligently search the solution space for finding 
the optimal solution of a problem. 

There are many GP tools (lil-gp, GeneXproTools, GPLab, 
ECJ, Open BEAGLE) [lj [21 Ej 3J [5] developed by GP 
practitioners. However, none of these address the follow¬ 
ing demands of end user before applying them to solve 


symbolic regression problems: (i) ease of use and (ii) 
small learning curve. Many of these tools are open source 
and available freely (lil-gp, ECJ, Open BEAGLE, GPLab 
for MATLAB) pQ H 0 0 whereas the rest are available 
commercially (GeneXproTools) [2j. Many of these tools 
necessitate modification in source code in order to gener¬ 
ate required experimental environment. Determining the 
final solution, produced by these tools, demands transla¬ 
tion of the output or digging the log files. Due to these 
reasons, the interest of researchers and engineers in GP 
may get reduced. Theses reasons motivated us to develop 
our own GP framework [B] which uses the postfix notation 
for an individual representation. 

We have considered the following features which need 
to be supported by Postfix-GP framework: (i) easy to ex¬ 
tend, (ii) simple and quick procedure for the configuration 
and running of GP, (iii) a set of algorithm implementation 
for: (a) generating the initial population, (b) selection 
mechanisms, and (c) genetic operators, (iv) visualization 
of: (a) Postfix-GP run analysis and (b) evolved solution 
with statistical measures, (v) one-step and multi-step pre¬ 
diction support, (vi) visualization of results for one-step 
and multi-step predictions, and (vii) storage and retrieval 
of evolved solutions to and from file. Postfix-GP has been 
used in experimental work El, H, 0, 0- 

This paper presents the design and implementation of 
Postfix-GP, an object oriented software framework for ge¬ 
netic programming. Section [2] gives introduction to GP. 
Section[3]presents the design of Postfix-GP. Moreover, the 
section also gives the implementation details and main 
features of Postfix-GP. Section [4] presents Postfix-GP as 
a solution modeling tool by solving the benchmark sym¬ 
bolic regression problem. Section[5]compares the features 
of Postfix-GP with lil-gp |Tj, ECJ |4], and JCLEC [TD) 
frameworks. This is followed by conclusions in Section [6] 
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Postfix-GP 


2 Introduction to Genetic Program¬ 
ming 

Standard GP EH employs a variable length, tree struc¬ 
ture scheme for an individual representation. The tree 
can be used to represent logical expressions (IF-THEN- 
ELSE), boolean expressions (AND,OR,NOT) or algebraic 
expressions. The symbolic regression aims to find the 
functional relationship (mathematical expression) between 
given instances of inputs-outputs. GP can be used to per¬ 
form Symbolic Regression (SR). When using GP for solv¬ 
ing symbolic regression problems, the user need to specify 
the following items: (i) GP configuration parameters, (ii) 
terminal set and function set, (iii) fitness function. 



Figure 1: Flowchart of Genetic Programming. 

The main steps of standard GP m are as follows: 

1. random generation of an initial population of can¬ 
didate solutions in tree form using the elements of 
function set and terminal set, selected by the mod¬ 
eler (user). 

2. calculating fitness value of every individual of the 
population on the given training dataset (fitness 
cases) 

3. selecting parents for mating based on the calculated 
fitness values, determined in previous step 

4. applying sub-tree crossover and mutation (genetic 
operators) on selected parents for generating a new 
population of individuals. 

The process is repeated until the termination condi¬ 
tion is fulfilled. 
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3.1 Features included in the Design of 
Postfix-GP 

The important features of the proposed Postfix-GP are 
categorized into: (i) training dataset, function set, and 
terminal set related, (ii) GP parameters related, (iii) test 
dataset prediction related, (iv) GP run analysis related, 
and (v) serialization and de-serialization of GP experi¬ 
ments and results. 

Training dataset, function set, and terminal set re¬ 
lated features: 

• Loading of training dataset 

• Loading of binary and unary functions 

• Loading of constants 
GP parameters related: 

• Selection of method to generate initial population 

• Configuration of GP parameters like population size, 
number of generations, crossover rate, mutation rate 

• Selection of crossover and mutation type 

• Selection of selection scheme 
Test dataset prediction related: 

• Loading out-of-sample test dataset for one-step pre¬ 
dictions 

• Visualization of results of one-step predictions with 
statistical measures 

• Loading out-of-sample test dataset for multi-step 
predictions 

• Visualization of results of multi-step predictions with 
statistical measures 

GP run analysis related: 

• Plotting best adjusted fitness vs number of genera¬ 
tions 

• Plotting average adjusted fitness vs number of gen¬ 
erations 

• Plotting solution size vs number of generations 

Serialization and de-serialization of GP experiments 
and results: 
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• Serialization of GP parameters, function set, termi¬ 
nal set, and obtained solutions to a file 

• De-serialization of GP parameters, function set, ter¬ 
minal set, and obtained solutions from a file 

3.2 Implementation of Postfix-GP 

This section presents the implementation details of Postfix- 
GP. This will be useful to the reader in understanding and 
customizing the proposed Postfix-GP framework. For de¬ 
tails related to Postfix-GP solution representation scheme, 
refer 0 , 0 - Postfix-GP framework is developed using 
Microsoft .NET framework [12] on Windows XP operat¬ 
ing system. The ZedGraph [13] class library is used for 
plotting the charts. ZedGraph is an open source graph 
library for .NET platform. 

The class diagram for Postfix-GP is depicted in Figure [2] 
The classes can be grouped into following categories: (i) 
representation (Genome, EquationGenome), (ii) popula¬ 
tion (Population), (iii) crossover operator (BaseCrossover, 
GA-like, Sub-tree, Semantic aware sub-tree), (iv) mutation 
operator (BaseMutation, Fully Protected, Partially Protected), 
(v) selection schemes (BaseSelection, Roulette-wheel, Tour¬ 
nament, Parsimony Pressure), (vi) GP parameters (GP Pa¬ 
rameters), and (vii) statistical analysis of results (GP Run 
Results). 

3.3 Individual, Population and Fitness 

In Postfix-GP, an individual represents a candidate solu¬ 
tion for a problem. We can create an instance of indi¬ 
vidual using EquationGenome class, which is derived from 
an abstract class called Genome. To generate an individ¬ 
ual, we can use two-argument constructor, which returns 
a new individual having ValidLength in between Min- 
Length and MaxLength. 

public class EquationGenome() { 
public static int MinLength; 
public static int MaxLength; 
private int ValidLength; 
private ArrayList TheArray = new ArrayList(); 
private int Rawfitness; 
private int Adjustedfitness; 

public EquationGenome(int Minlength, int Maxlength) 

I 

An individual comprises a genotype and a phenotype. 
The genotype is of type array of size equals to MaxLength 
specified by the user. In addition to the genetic material, 
an individual also contains raw fitness and adjusted fit¬ 
ness. The raw fitness is not range bound whereas the 
adjusted fitness can take values in the range of 0 to 1. 


In Postfix-GP, an instance of a population can be cre¬ 
ated using the class Population. An instance of Popula¬ 
tion contains a collection of instances of EquationGenome. 
Individuals can be added to and removed from the popu¬ 
lation (collection) using the iterator. Iterator is useful to 
iterate over a collection in sequential order. To create an 
instance of a population, several Postfix-GP parameters 
need to be passed. Following constructor can be used for 
the same. 

public Population(CurrentDirectory, NoofGeneration, 
NoofGenerationperCascade, PopulationSize, MinLength, 
MaxLength, MutationFrequency, CrossoverType, Cross- 
overFrequency, SelectionStrategy, Intervalarithmetic, 
SemanticSensitivity, InitialPopulationType, MutationType) 

The number of elites of a population can be set using 
archivepopulationsize parameter. By default, the archive 
size is set to 1/10 of population size. The information 
related to an instance of a Population can be printed to a 
log file in human interpretable form. The logged popula¬ 
tion information is presented below. It begins by printing 
the number of generation, then details of individuals of 
current population and archive. For each individual it 
prints the genotype representation, its ValidLength and 
adjusted fitness value. 

Generation 0 
Population 

a#-0.6#a#a#*#*#-#a#-0.3#0.8#+#-#*# Ad- 
jFit -> 0.0011 -> ValPos -> 13 

a#a#-0.8#-#*#a#/#-0.2#a#0.3#-#*#+#^ AdjFit 
-> 0.0010 ValPos -> 13 

a#-0.6#a#a#*#-0.8#a#a#*#*#*#+#-#-0.7#+#^ 
AdjFit ->• 0.0002 ->• ValPos ->• 15 
a#a#-0.4#a#*#-#a#0.1#S#+#*# AdjFit 0.0003 

—>• ValPos —> 12 

Archive 

a#a#-0.4#a#*#-#a#0.1#S#+#*# AdjFit 0.0003 

—>■ ValPos —> 12 

a#-0.6#a#a#*#-0.8#a#a#*#*#*#+#-#-0.7#+#^ 

AdjFit -» 0.0002 -> ValPos 15 

Finally, a fitness value is assigned to an individual by 
evaluating it on training dataset. A code snippet for cal¬ 
culating a fitness of an individual is presented here. The 
code checks for type of terminal before performing any 
operation. 

public float PostFixCalculation(float[] inputs) { 
equationstack.ClearQ; 
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Figure 2: Class diagram for Postfix-GP. 


for (int i = 0; i < this.ValidLength; i++) { 
if ((int)TheArray[i] < Operandcount) { 

// Push a single input value on stack 
equationstack.Push(inputs[(int)TheArray[i]]); 

} 

else if ((int)TheArray[i] < BinaryoperatorRange) { 
if (equationstack.Count > 0) { 
y = (float)equationstack.Pop(); 
if (equationstack.Count > 0) { 
x = (float)equationstack.Pop(); 

result=DoOperation(x,y,Binaryoperators[(int)TheArray[i] — 
Opera ndRange].ToString()); 
if(double.lsPositivelnfinity(result) 
||double.lsNegativelnfinity(result)) { 
return result; 

} 

else 

equationstack. Push (result); 

} 

} 

} 

else if ((int)TheArray[i] < UnaryoperatorRange) { 


} 

} 

} 


3.4 Selection Methods 

The task of selection mechanism is to select individu¬ 
als from current population and previous (old) archive. 


Postfix-GP permits use of different selection mechanism 
for population and archive. All implemented classes for 
selection mechanisms are derived from abstract class BaseS- 
election. The derived classes, Roulettewheel, Tournament, 
and Parsimonypressure, are required to implement the fol¬ 
lowing two methods: Select() and setlndividuals(ArrayList 
individuals). Each time the Select() method is called, it 
returns the index value. The individual at the given in¬ 
dex value in the current population or archive is selected 
as one of the parents for crossover. The implementation 
of SelectQ method differs from one selection mechanism 
to another. 

setlndividuals() method is used to fix whether the individ¬ 
uals are selected from the current population or from the 
old archive. 

public int Select(); 

public void setlndividuals(ArrayList individuals); 

3.5 Genetic Operators 

3.5.1 Crossover 

Postfix-GP provides implementation for following three 
types of crossover operators: (i) GA-like one-point crossover 
m, ( ii) sub-tree crossover El: and (iii) semantic aware 
sub-tree crossover EE]. The selection of crossover type 
is done by the user through GUI. The base class for all 
crossovers is BaseCrossover. The derived classes, GAlike, 
Subtree, and Semantic aware subtree, are required to im¬ 
plement the CrossoverQ method. The maximum number 
of trials parameter, used by semantic aware crossover, is 
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by default set to value 20. 

public abstract class BaseCrossOver { 
private int MaxCrossoverCount = 20; 
abstract public Genome[] Crossover(Genome genel, Genome 
gene2); 

} 

3.5.2 Mutation 

Postfix-GP provides implementation for following two types 
of mutation operators: (i) fully protected and (ii) par¬ 
tially protected. The fully protected mutation selects 
an index value within the range [1, ValidLength] and 
changes the element positioned at the selected index value 
to another element of the same arity. The partially pro¬ 
tected mutation selects an index value within the range 
[1, Max Length]. If the selected index value is less than 
MinLength then it applies fully protected mutation (an 
element positioned at selected index is replaced with an¬ 
other element of the same arity only) else it does not ap¬ 
ply this constraint (an element positioned at selected in¬ 
dex can be replaced with any other element having either 
same or different arity). The selection of mutation type is 
done by the user through GUI. The base class for all types 
of mutation is BaseMutation. The derived classes, Ful- 
lyProtected and PartiallyProtected, are required to imple¬ 
ment Mutate() method. The value of MaxMutationCount 
variable decides how many times Postfix-GP should try 
to ensure that the mutation operator generates a different 
gene than the original value. The default value for Max¬ 
MutationCount is set to value 10. The default value for 
mutation probability of operators (functions set) is set to 
value 0.6. The code snippet for abstract class BaseMuta¬ 
tion and mutate() method, implemented by FullyProtected 
class, are presented below. 

public abstract class BaseMutation { 
public int MaxMutationCount = 10; 
private double operatormutatefrequency = 0.6; 
abstract public EquationGenome Mutate (Equation- 
Genome gene); 

} 

3.6 Sub-trees as Solutions 

Postfix-GP derives all sub-trees of an individual having 
the Validlength greater than the MinLength and treats 
the extracted sub-trees as separate solutions (individu¬ 
als) during the evolutionary process. This approach is 
useful to GP for exploring large solution search space and 


improving GP performance. Moreover, Postfix-GP uses 
elitism operator to preserve good solutions from one gen¬ 
eration to the next generation without being affected by 
the destructive nature of crossover and mutation. Elitism 
also provides good building blocks for producing better 
solutions in successive generations. Postfix-GP preserves 
a set of highly fit solutions separately from the main pop¬ 
ulation. Postfix-GP implements elitism using a fixed size 
archive. An archive is used to preserve the good solutions, 
found so far. 

3.7 Serialization/De-serialization of Pop¬ 
ulation and GP-Parameters 

Postfix-GP provides GUI for storing and retrieving of the 
population of individuals as well as GP-parameters from 
file system. The information about the final population 
and GP parameters are stored in binary format. The 
Serialize!) and Deserialize() methods of BinaryFormatter 
class are used for storing and retrieving object informa¬ 
tion. 

3.8 Statistics 

Postfix-GP provides a facility to collect statistical de¬ 
tails at every generation of a GP run. It stores data 
for individuals of both population and an archive into an 
excel hie. It records following details for individuals of 
an archive per generation: (i) adjusted fitness value of 
the best individual of current generation, (ii) size of the 
best individual (solution), (iii) average adjusted fitness of 
archive, (iv) average node (element) count of archive, (v) 
best individual found so far, (vi) size of the best individ¬ 
ual found so far, and (vii) Mean Absolute Error (MAE), 
Normalized Mean Square Error (NMSE), and Correla¬ 
tion Coefficient (CC) values for the best solution found 
so far. WritePopulationAnalysisDataQ and WriteArchive- 
AnalysisData() methods of PostfixGPGUI are used to store 
these data into the hie. The mentioned statistical infor¬ 
mation are used to produce the following graphs: 

• Average adjusted htness vs generations 

• Average node count vs generations 

• Best adjusted htness vs generations 

• Average adjusted htness, average node count, and 
best adjusted htness vs generations 
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3.9 Implementation of GUI 

The class responsible for providing functionalities of GUI 
is named PostfixGPGUI. It is used to collect training dataset, 
function and terminal set, and Postfix-GP configuration 
parameters, provided by the user, and passing these data 
to Postfix-GP core. The GUI is useful to extend the func¬ 
tionalities of Postfix-GP core. The user can directly load 
the training dataset from stored comma separated (.csv) 
file. The LoadDataFile() method of PostfixGPGUI class is 
used to load the training dataset. The LoadDataFileQ in¬ 
ternally uses getColumnNamesQ, getColumn(), and getRowQ 
methods of FileOperation class to determine the header 
of columns, number of columns, and number of rows in 
training dataset at run time. Then it calls getDataQ 
method of FileOperation class to load the training dataset 
into rows of DataGridView. 

At the end of Postfix-GP run, GUI displays the best 
evolved solutions. The user can visualize the statistical 
details of any of the evolved solutions by pressing mouse 
button on the selected solution. The corresponding refer¬ 
ence of an instance of EquationGenome is obtained from 
the current instance of Population. The EquationGenome 
reference is used to invoke GetRawfitness(), GetAdjusted- 
fitness(), GetCorrelationCoefficient(), CalculateNMSEFitnessQ, 
GetValidLength() methods of EquationGenome class which 
returns the rawfitness, adjustedfitness, r, NMSE, and size 
of solution measures for the selected solution. Then, 
the instance of Graphs is created. The values for men¬ 
tioned measures are passed as arguments of DrawEqua- 
tionMethod() method of Graphs class. The DrawEqua- 
tion() and DrawGraph() methods of Graphs class are use¬ 
ful to plot: (i) solution details and (ii) analysis of results 
of Postfix-GP run. 

4 Case Study 

We have taken the benchmark symbolic regression prob¬ 
lem, solved using GEP approach in [15]. Equation(l) is 
used to generate a set of twenty one equidistant points 
in range [-10:1:10]. These points are used as fitness cases 
(training dataset). 

y = 3 * (x + l) 3 + 2 * (x + l) 2 + (x + 1), (1) 

The population size and the number of generations 
are set to values 50 and 200. The crossover and mutation 
rates are set to values 0.9 and 0.1. The MinLength and 
MaxLength parameters of Postfix-GP are set to values 15 
and 35. The terminal set T consists of the independent 
variable, x, and a list of constants, T = { x , list of 



Figure 3: Postfix-GP GUI for Experimentation 



Figure 4: Postfix-GP GUI for analysis of GP run 


constants }. The selected constants values are {1,2,3,5,7}. 
The function set F consists of {+, -,*,/}. 

The left hand side panel of Postfix-GP GUI (Figure[3]) 
provides the facility to load the training dataset, the func¬ 
tion set, and the terminal set from the files. The accepted 
file format is comma separated values (.csv). The train¬ 
ing dataset can be loaded from a file by activating Load 
Fitness Cases button of GUI. The function set and the 
terminal set can be loaded by activating Load Operators 
and Load Constants buttons of GUI. The right hand side 
panel of Postfix-GP GUI (Figure [3]) provides facility to 
configure Postfix-GP parameters. 
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Postfix-GP fit for Training dataset 



Figure 5: Postfix-GP GUI showing statistical measures 
for the evolved solution for training data 


y = (x + (((((x * (5 + (x + (2 * x)))) * (x + 2)) + (5/5* 
(5/5)))) — 5) + (x + (2 * (x + 5))))) (2) 

The separate GUIs are provided for visualization of: 

(i) evolved solution with its statistical measures, (ii) Postfix- 
GP run analysis, and (iii) Prediction of out-of-sample 
datapoints. Figure [5] presents visual representation of 
evolved solution with its statistical information. Equa¬ 
tion ([2]) represents one of the best solutions found by 
Postfix-GP. Simplifying this equation results in equation 

0 . 

Figure [5] depicts the comparison of actual function and 
the function modeled by the evolved Postfix-GP solution. 
The evolved solution has structural complexity (number 
of nodes) of 33. The statistical measures for the evolved 
solution for training dataset are as follows: MAE = 0, 
NMSE = 0, AdjustedFitness = 1, and r = 1. Figure [3] 
presents the visual representation of Postfix-GP run anal¬ 
ysis over 50 generations. It displays the plots of the av¬ 
erage best fitness vs generation and the average solution 
size vs generation. 

Figure [6] shows out-of-sample (test data) one-step ahead 
predictions obtained using the evolved solution for the 
values of x in the range [11:1:20]. The statistical mea¬ 
sures for one-step predictions are as follows: MAE = 0, 
NMSE = 0, AdjustedFitness = 1, and r = 1. The error 
of zero value suggest that the one-step ahead predictions 
are very good. 


SoUon |(x+((((( x*|5-«-(x-(2*x))))*(x+2))+(5/(5*(5/5))))-5)-(x-*-(2*(x-5))))) 



One-step ahead Prediction 



Input Points 


Figure 6: Postfix-GP GUI for one-step prediction of out- 
of-sample data 

5 Features Comparison of Postfix- 
GP with lil-gp, ECJ, and JCLEC 

We have compared the features of Postfix-GP with lil- 
gp [I], ECJ 0|, and JCLEC [ID] GP systems. We have 
selected these GP systems because they are open source 
and freely available. Most of these systems do not pro¬ 
vide easy to use functionality to end users. These systems 
can be used by persons having experience in the held of 
the evolutionary computing. Many systems are dedicated 
to a particular flavor of EC and others are generic (can 
be used for different flavors of EC). For example, lil-gp 
[H provides support for GP only, whereas ECJ [4] and 
JCLEC [TO] provide support for Genetic Algorithm (GA) 
[T5] , Genetic Programming (GP) (TT|, and Gene Expres¬ 
sion Programming (GEP) [17] . However, for comparison 
of these systems with Postfix-GP, we have only consid¬ 
ered GP flavor of these systems. Table [I] presents the 
features comparison of Postfix-GP with these systems. 

lil-gp: lil-gp T] is a GP toolkit implemented in C pro¬ 
gramming language. The toolkit is efficient and fast, as it 
is implemented in C. However, it is difficult to extend the 
toolkit compare to other object-oriented implementations 
of GP systems, lil-gp uses only tree structure for an in¬ 
dividual representation and provides limited fitness mea¬ 
sures. Moreover, the toolkit does not provide graphical 
user interface to read an input (training) data hie. The 
toolkit uses a parameter hie to load GP parameters. The 
toolkit produces six reporting hies (.sys,.gen,.prg,.bst,.his 
and .stt) that provide the statistical information of the 
GP run. There are many patches developed by different 
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Table 1: Comparison of the features of Postfix-GP system with other GP systems 


No. 

Features 

lil-gp 

JCLEC 

ECJ 

Postfix-GP 

1 

Genome representation 

Tree 

Tree 

Tree 

Linear string 

2 

GP parameters manage¬ 
ment 

Initialization technique 

Text file (parameter file) 

Configuration file 

GUI configuration file 

GUI 

3 

Full, Grow, Half-and-Half 

Random, Koza 

Random, Koza, PTC1, 
PTC2 

Random, Semantically 
diverse 

4 

Crossover operators 

Sub-tree 

Sub-tree, Tree 

Sub-tree 

Sub-tree, GA-like one- 
Point, Semantic sub- 

5 

Limiting size of solution 

Number of nodes/Depth 
limit 

Number of nodes/Depth 
limit 

Number of nodes/Depth 
limit 

MinLength, MaxLength 

6 

Selection schemes 

Fitness proportional, 

Tournament, Random, 

Best, Worst 

Elitist, Fitness pro¬ 

portional, Tournament, 
Random, MuLambda, 

Boltzman 

Elitist, Fitness propor¬ 
tional, Tournament, 

Random, MuLambda, 

Boltzman 

Archive, Fitness propor¬ 
tional, Tournament 

7 

Adding elements to func- 

C function (function.c, .h) 

Java file, Parameter file 

Java file, Parameter file 

C# Function, Parame¬ 
ter file 

8 

Cascading functionality 

No 

No 

No 

Yes 

9 

Sub-trees as solutions 

No 

No 

Yes (GEP) 

Yes 

10 

GP run statistics 

Report files 

GUI, Text file 

.stat file 

GUI, Text file 

11 

GUI for prediction using 
evolved model 

No 

No 

No 

Yes 

12 

Generic 

No 

Yes 

Yes 

No 

13 

Learning curve 

Small 

Medium 

High 

Small 

14 

GUI for visualization of 
statistical results 

No 

Yes 

No 

Yes 

15 

Architecture 

Procedure oriented 

Object oriented 

Object oriented 

Object oriented 

16 

Programming language 

C 

Java 

Java 

c# 


researchers to fix the bugs and to improve the function¬ 
ality of the basic lil-gp. 

ECJ: ECJ H is a Java based framework for evolu¬ 
tionary computation and genetic programming. ECJ is 
designed using the object-oriented concepts. Classes of 
ECJ framework are divided into four layers [3]: (i) util¬ 
ity layer, (ii) basic and custom evolutionary computation 
layer, (iii) basic and custom genetic programming layer, 
and (iv) problem layer. As the framework is implemented 
in Java, it is slower in speed. Moreover, ECJ uses a tree 
(and not an arrays) to represent an individual, which re¬ 
quires dynamic memory allocation. Thus, the framework 
consumes large memory. ECJ determines GP parame¬ 
ters from a parameter file. ECJ determines classes to be 
loaded, the type of problem to be solved, the type of tech¬ 
nique to use to solve the problem, and the way to report 
the statistical results of the run from the parameter file at 
the run time [3]. This provides easy to extend functional¬ 
ity to the ECJ. ECJ stores statistical information of GP 
run in a text file. Moreover, it provides flexibility to pro¬ 
duce the extra output files through class customization 
but does not provide a GUI to visualize this information. 

JCLEC: JCLEC [ID] is a Java based framework for 
evolutionary computation and genetic programming. JCLEC 
m is designed using the object-oriented concepts. The 
classes of JCLEC framework are divided in three layers: 

(i) system core, (ii) experiments runner (reads an EA 
script file, execute all indicated algorithms and produce 
report files), and (iii) GenLab (a GUI on the top of ex¬ 
periments runner and system core layers, provides func¬ 
tionality to edit the experiment files and to view the GP 
run results) [TO] , GP parameters can be set by the user 


either through GenLab GUI or through an XML (config¬ 
uration) file. However, the structure of configuration files 
is not user friendly. The framework provides the GUI to 
visualize statistical information of GP run. The JCLEC 
framework is easy to extend. 

6 Conclusions 

This paper presented the design and implementation of 
Postfix-GP framework. The implementation details of 
Postfix-GP, including an individual representation, differ¬ 
ent crossover operators, mutations, and selection mecha¬ 
nism were also presented. Postfix-GP provides user inter¬ 
active GUI for performing different activities. The user 
can load training dataset, function set, and constants. 
The user can set the GP parameters through GUI. The 
evolved solutions with their statistical measures can be 
visualized through GUI. Moreover, the user can also per¬ 
form one-step and multi-step predictions using GUI. The 
evolved solutions can be stored in binary format and can 
be retrieved later on. The user can also analyze Postfix- 
GP run through GUI. Postfix-GP as a solution modeling 
tool was presented by solving symbolic regression prob¬ 
lem. Postfix-GP addresses the requirements of ease of use 
and small learning curve before utilizing it to solve the 
problems. It was developed to minimize the user’s time 
required to set up and run GP experiments. 
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